Goto

Collaborating Authors

 neural processing unit


UDC: Unified DNAS for Compressible TinyML Models for Neural Processing Units

Neural Information Processing Systems

Deploying TinyML models on low-cost IoT hardware is very challenging, due to limited device memory capacity. Neural processing unit (NPU) hardware address the memory challenge by using model compression to exploit weight quantization and sparsity to fit more parameters in the same footprint. However, designing compressible neural networks (NNs) is challenging, as it expands the design space across which we must make balanced trade-offs. This paper demonstrates Unified DNAS for Compressible (UDC) NNs, which explores a large search space to generate state-of-the-art compressible NNs for NPU. ImageNet results show UDC networks are up to 3.35x smaller (iso-accuracy) or 6.25% more accurate (iso-model size) than previous work.


Evaluating the Energy Efficiency of NPU-Accelerated Machine Learning Inference on Embedded Microcontrollers

Fanariotis, Anastasios, Orphanoudakis, Theofanis, Fotopoulos, Vasilis

arXiv.org Artificial Intelligence

The deployment of machine learning (ML) models on microcontrollers (MCUs) is constrained by strict energy, latency, and memory requirements, particularly in battery - operated and real - time edge devices. While software - level optimizations such as quantizatio n and pruning reduce model size and computation, hardware acceleration has emerged as a decisive enabler for efficient embedded inference. This paper evaluates the impact of Neural Processing Units (NPUs) on MCU - based ML execution, using the ARM Cortex - M55 core combined with the Ethos - U55 NPU on the Alif Semiconductor Ensemble E7 development board as a representative platform. A rigorous measurement methodology was employed, incorporating per - inference net energy accounting via GPIO - triggered high - resolutio n digital multimeter synchronization and idle - state subtraction, ensuring accurate attribution of energy costs. Experimental results across six representative ML models -- including MiniResNet, MobileNetV2, FD - MobileNet, MNIST, TinyYolo, and SSD - MobileNet -- dem onstrate substantial efficiency gains when inference is offloaded to the NPU. For moderate to large networks, latency improvements ranged from 7 to over 125, with per - inference net energy reductions up to 143 . Notably, the NPU enabled execution of model s unsupported on CPU - only paths, such as SSD - MobileNet, highlighting its functional as well as efficiency advantages. These findings establish NPUs as a cornerstone of energy - aware embedded AI, enabling real - time, power - constrained ML inference at the MCU level.


UDC: Unified DNAS for Compressible TinyML Models for Neural Processing Units

Neural Information Processing Systems

Deploying TinyML models on low-cost IoT hardware is very challenging, due to limited device memory capacity. Neural processing unit (NPU) hardware address the memory challenge by using model compression to exploit weight quantization and sparsity to fit more parameters in the same footprint. However, designing compressible neural networks (NNs) is challenging, as it expands the design space across which we must make balanced trade-offs. This paper demonstrates Unified DNAS for Compressible (UDC) NNs, which explores a large search space to generate state-of-the-art compressible NNs for NPU. ImageNet results show UDC networks are up to 3.35x smaller (iso-accuracy) or 6.25% more accurate (iso-model size) than previous work.


10 ways Snapdragon AI PCs are just like Chromebooks

PCWorld

Microsoft has dreamed of competing with Chromebooks for many years, having previously pushed products like Windows 10 S (a limited version of Windows that powered the first Surface Laptop) and Windows 10X (some interesting ideas but was canceled and never released). Today, Snapdragon X Elite PCs run a "full version of Windows" in many ways--but in using one myself, I'm noticing all kinds of similarities between them and Chromebooks. While Microsoft wants to talk up its AI-focused Copilot PC branding, these PCs powered by Snapdragon X Elite and Snapdragon X Plus chips run Windows on Arm--and they bring serious improvements that feel a lot like Chromebooks in various ways. Thanks to Prism, Windows 11 on Arm can run most--but not all--traditional Windows applications that are coded for x86 processors. In other words, most applications will install and "just work."


The AI PC revolution: 18 essential terms you need to know

PCWorld

These effects work in any application that uses your PC's built-in webcam, and it uses the NPU to apply these effects in a power-efficient way without draining your battery too quickly. While Windows Studio Effects are nice to have, I don't think they're reason enough to buy an AI PC, and certainly not a Meteor Lake-based AI PC. Intel's Lunar Lake is what the first AI PCs should have been. As more laptops ship with powerful NPUs, third-party application developers will likely start using them to add powerful AI features to their Windows desktop applications, putting the powerful PC hardware to use. I'm sure that's what Microsoft is hoping for, anyway. If you've gotten this far, congrats! You're now up to speed on all the most important AI PC terminologies, and you should understand enough now to see where all of this might be heading soon.


Copilot PCs explained: AI Windows meets Arm laptops

PCWorld

Microsoft recently announced at its developer conference about artificial intelligence, a new laptop class with significantly more powerful Snapdragon X CPUs, and its new operating system for the Arm architecture all sounds really promising. This is also true in view of the fact that Apple took this step very successfully and consistently years ago. In the meantime, the entire Mac model range has been converted to Arm CPUs from its own production; models with Intel x86 processors are no longer offered. The fact that Microsoft is now announcing a similar step together with the important computer manufacturers Acer, Asus, Dell, HP, Lenovo, and Samsung is therefore definitely a sign of things to come. On the other hand, as a long-time IT observer, it is almost alarming when Microsoft once again announces a "new PC era."


Your 'AI PC' is already obsolete: The curse of early adoption strikes again

PCWorld

"The year of the AI PC" got off to a strange start. All the "AI PCs" sold by manufacturers for the first half of the year are now effectively out of date. They won't be able to run Windows Recall, the Windows Copilot Runtime, or all the other AI features Microsoft showed off for its new Copilot PCs. Microsoft's Copilot PC certification just taught us a valuable lesson in buying PC hardware: Never buy hardware based on the promise of what it might be able to do in the future. Only buy PC hardware because of what it can actually do today.


Acer TravelMate P6 review: Business on a budget

PCWorld

The Acer TravelMate P6 offers excellent value for a business laptop, with long battery life, a surprisingly light weight, and more ports than a typical consumer laptop. The Acer TravelMate P6 is a business laptop through and through. It's packed with ports, delivers long battery life, is surprisingly lightweight, and has a nice matte screen designed to avoid glare in normally uncomfortable lighting conditions. It's a nice and supremely practical piece of hardware, and I'd be happy to get a machine like this from my job. Starting at a retail price of 1,329, it's a bargain as far as business laptops go, especially if a workplace is getting a discount for buying a bunch at once! But if you're just looking to buy a single laptop for your own personal use, a consumer laptop may be better bet.


Intel kicks off the 'AI PC' era with Core Ultra chips

PCWorld

Intel's Core Ultra era begins now! Intel is shipping its first "Meteor Lake" 14th-gen Core Ultra chips in laptops beginning today, ushering in the new "AI PC" era -- as well as actually telling you what's in them and how fast they're expected to be. Though leaked benchmark results, we've known a bit about Intel's Core Ultra product lineup. But now it's official: Intel will offer eleven new mobile Core Ultra processors, both in the "H" high-performance segment and the low-power "U" family. Intel has disbanded the earlier "P-series" lineup.


The Rise of Intelligent Edge Devices with AI Acceleration

#artificialintelligence

The topic of AI is not new and each one of us is benefiting from AI every day, transforming many aspects of our lives. This trend is fueled by edge computing which is providing opportunities to move AI workloads from the Intelligent Cloud to the Intelligent Edge for improved response times and bandwidth savings. In combination with Digital Twins and IoT, there is a strong trend not only in manufacturing but also in other industries to leverage AI/ML analytics for getting better and faster insights for improved Predictive Maintenance and more. The benefit of edge deployments is especially strong when it comes to computer vision models that take large data streams like images or live video as input. With edge computing, these large data streams can now be processed locally at the device / client, eliminating the need for significant bandwidth or privacy concerns associated with streaming into a cloud data center. Edge video analytics systems can execute computer vision and deep-learning algorithms either directly integrated into the camera or with an attached edge computing system.